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C\l ■ Abstract 

We give a new, elementary, purely analytical development of Descartes' theorem 
that a smooth connected surface is a perfect focusing lens if and only if it is a connected 
subset of the ovoid obtained by revolving a cartesian oval around its axis of symmetry. 
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1 Introduction 

Almost two thousand three hundred years ago, the hellenistic mathematician Diokles [1] 
gave the first proof that a mirror in the shape of a paraboloid of revolution reflects all incident 
light rays, which are parallel to its axis of symmetry, to a single point, which Kepler [8], 
in 1604, called the focus. 
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After the advent of the calculus it was possible to prove that the only such reflecting 
surface is generated by revolving a (proper or degenerate) parabola around its axis of sym- 
metry. This is a very famous and well-known result, and is treated in many easily accessible 
sources. See, for example, Spiegel [T6] . 

All these proofs are based on Heron's Law of Reflection, 9j = O^f, where 0Rf is 
the angle the reflected ray makes with the normal to the reflecting surface at the point of 
incidence of the incoming ray and 8j is the angle the incident ray makes with the normal. 
One transforms Heron's equation into the ordinary differential equation (ODE) of 
the cross-section curve of the mirror. Drucker's paper [5j would seem to be the final word 
on the subject. 

Unfortunately and surprisingly, the corresponding result for a lens, instead of a mirror, 
is less well-known, at least among mathematicians (although [TT] is pleasant attempt to alter 
that). Yet the case of a lens, too, is quite fascinating and is treatable by elementary means. 
The purpose of this paper is to remedy the situation and fill this gap. 

Indeed, it all started in 1637, when Descartes [3] asked for the refractive analogue of 
the parabolic mirror: 

Which shape of lens will focus all rays from one radiant point source to one 
single image point? 

We will call such a lens a perfect lens. 

Descartes discovered that the cross-section curve of the perfect lens, assumed to be 
a surface of revolution, is a fourth degree curve known today as the cartesian oval. It 
can be defined as the locus of points the "weighted" sum of whose distances from two fixed 
points is a constant: 

' (i.o.i) 



d\ + nd2 = c 



where d\ and di are the distances from any point on the curve to the two fixed points, called 
the foci, and n is a constant. If one focus is at the origin and the other is at the point (b, 0) 
where 6^0, the equation can be written: 



(1 - n 2 ){x 2 + y 2 ) + 2n 2 bx + c 2 - n 2 b 2 



Ac 2 (x 2 + y 2 ) 



1.0.2) 



If n = ±1, the oval is the conic section: 




;i.o.3) 



More information on cartesian ovals can be found in [14] and [15] and [18] . 

Descartes' own treatment, which is not altogether easy to read (see pQ), shows that 
the oval is a solution, but does not show that it is the only solution. 

The only treatments of DESCARTES' result that we have seen in the literature do not 
appear in books on mathematics (!), but rather on optics (see HECHT [7] and KLEIN [9]) 
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and use Fermat's Principle: A light ray traverses the path between two points which takes 
the least time. 

A non-trivial computation, based on the calculus of variations, shows that the time the 
light ray takes to go from the radiant point to the image point is constant for every point 
of the cross-section curve of the perfect lens (since if the time were different in two points of 
the curve, it would not be minimal), and therefore its equation is that of the cartesian oval. 

Moreover such treatments make physical assumptions about the velocity of light in differ- 
ent media, while, as our treatment will show, the problem is really one in pure mathematics. 

We have not seen any treatment of the subject which is founded purely on Snell 's law 
of refraction, which describes the relationship between the angle of incidence and the angle 
of refraction when light passes the boundary between two isotropic media (media in which 
the path of a light ray is a straight line). The law states: 

If 6 1 is the angle the incident ray makes with the normal to the boundary at the 
point of refraction, and if Or is the angle the refracted ray makes with the normal, 
then at all points of the boundary the ratio 

sin 9 1 
sin 6 R 

where n, called the index of refraction, is constant . 

Such a treatment of Descartes' theorem would seem desireable, since it is the immediate 
generalization of the corresponding treatment of the perfect reflective mirror. 

In this paper, we will present a new, self-contained, elementary, purely analytical proof, 
based on Snell's Law and Drucker's paper [5], of the following complete form of Descartes' 
theorem: 

Theorem 1. (^Descartes' Theorem) A smooth connected surface is a perfect lens if and 

only if it is a connected subset of the ovoid obtained by revolving a cartesian oval around its 
axis of symmetry. 

□ 

2 The Analytical Problem (in two dimensions) 

We begin by solving the following two-dimensional purely analytical problem: 

It is required to find the equation, f(x,y) = 0, of a smooth connected curve, C, 
for which the straight lines from two fixed points cut the normal in two angles 
whose sines are in constant ratio. 

Please note the absence of physical modeling. The problem is purely mathematical, as its 
its solution. 

We will find and solve an ordinary differential equation (ODE) for which the equa- 
tion of the curve is the general solution. The ODE, in fact, will be a restatement of Snell's 
law. 
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Definition 1. We call any curve C that solves the problem a perfect two-dimensional 

lens with respect to the points F and F' . 



2.1 Both Fixed Points Are Finite 
2.1.1 The Differential Equation 

We assume a cartesian coordinate system in the xy-plane. 

Let the two fixed points be O(0, 0) and B(6, 0) with b > (Here is we use the assumption 
that F and F' are finite and distinct). Let P(x, y) be a variable point on the curve f(x, y) = 0. 
We assume P is in the first quadrant and we assume that the curve is concave downwards 
at P (that is, if y(x) is the function defined implicitly by the equation f(x, y) = 0, then 
y"(x ) < 0) . Let l\ be the length of the line segment OP and let / 2 be the length of PB. 
Let MN be the normal to the curve where N is on the concave side of the curve and P is 
between M and N. Let 9\ := MPO, the angle that OP forms with the normal MN, and let 
62 := BPN, the angle that BP forms with the normal MN. Let PT be the tangent line 
to the curve at P where T is the point on the x-axis where the tangent line crosses it. Let 
(ft := PTB, the angle, measured counter clockwise, the tangent line forms with the x-axis. 

We will use the geometry of the figure to obtain formulas for sin^ and sin# 2 in terms 
of x, y, and the derivative, y'. When we substitute these expressions into Snell's Law, we 
obtain the desired differential equation for the curve f(x, y) = 0. 

By the law of cosines applied twice to the AOPB 



cos POB = 



b 2 + q- q 

2hb 



cos P BO := 



b 2 + Ij - l\ 
2l 2 b 



But 



cos POB = cos(6>i + - 90) 
= sin(6>i + 0) 



cos PBO = cos(90 - 2 - 4>) 
= sin (# 2 + (ft) 



So, we obtain our fundamental formulas: 



sin(6»i + (ft) 



b 2 + q- 1\ 

2hb 



sin(# 2 + (ft) : 



b 2 + q - q 
2l 2 b 



(2.1.1) 



Moreover, it is evident that 



- < 9i + (ft < n 



0<6 2 + (ft<- 



(2.1.2) 
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Now 



b 2 + l\ - l\ 
2kb 



2bx 
2hb 

x 

h 



b 2 + l 2 2 - If 2b 2 - 2bx 



2Ub 



2l 2 b 
b — x 



Therefore, equations (12.1.11) become 



sin(0i + 



x 
h 



sin (9 2 + 



b — x 



By (I2.1.ip and (12.1.2P and the definition of the arcsin function, we obtain 



7T- (0i 



arcsm 



6*2 + 6 = arcsin 



b — x 



and therefore 



Q\ = Ok — <p) — arcsin 



9o = arcsin 



b — x 



whence, 



and 



sin 9i = sin(7r — <f>) cos { arcsin ( — 



x 



si n < :>\ j 1 — -p + cos^ ^ 



x 



y 



i - 



i 



X 

com k - <>) sin i arcsm | — 



sin 6*2 = cos (/> sin < arcsin 



b — x 



b — x 



1 (b — x 



— sin cos < arcsin 



b — x 



'1 + 2/ 

2/2/ 



/2 



1 - 



b — x 
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Now, our assumption is that Snell's Law holds, i.e., that 

sin 9\ 



sin 60 



n 



where n is a constant, holds for every point P(x, y) of the curve. Substituting our two 
formulas for sin 9\ and sin 6 2 into this equation gives us the equation: 



l + y 



12 



x l 1 
1- — + 



X 



1 



b — x 



y'y 



n 



(2.1.3) 



Solving this equation fl 2 . 1 . 3 [) for y' we obtain the differential equation of the curve: 



n 



x 



y_\ ,(w 



(2.1.4) 



2.1.2 The Solution of the Differential Equation 

We use the "arrow" notation. "P =>■ Q v means "the proposition P (logically) implies the 
proposition Q." 



- I 3D >(j~) 



nyy 



n 



x 



h J V h 
yy' + x nyy' 



x 
h 

n(b — x) 



~ h 

yy' + x 

J x 1 + y 2 



+ n 



k 

-jb-x) +yy' 
^{b-x) 2 + y 2 

1 2W + 2x 1 -2(b -x)+ 2yy' 
-4- n • 

2 ^x 2 + y 2 2 ^{b-x) 2 + y 2 

> — I \/ x 2 + y 2 + n-sj {b — x) 2 + y 2 
=>- a/x 2 + y 2 + ny/ {b — x) 2 + \ 












for some (arbitrary) constant c. We have therefore proved: 

Theorem 2. The general solution for the differential equation (I2.1.4P of the perfect two- 
dimensional lens, C, with respect to the points F = (0,0) and F' = (6,0), where b ^ 0, is 
given by the equation: 

' ==== ' (2-1.5) 



\/ x 2 + y 2 + n\J ib — x) 2 + 
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□ 



As we saw, this is the equation (11.0.11) of a cartesian oval with foci at the points (0, 0) 
and (b, 0). 

We have assumed that b is finite in this analysis, i.e., that the two foci are a finite distance 
apart. 

Now we consider the limiting cases where one or both foci are "at infinity." We will see 
that we obtain proper or degenerate conic sections for these cases. 

2.2 One Fixed Point At Infinity; One Fixed Point Finite 
2.2.1 The Differential Equation 

We will slightly alter the treatment for the case of two finite foci. To do so, we begin with 
the following: 

Definition 2. A point at infinity is specified by means of a line through the origin. The 
line joining P to a point at infinity is the line through P parallel to the given line. 
Points at infinity are not considered to be on C. 

We assume that the fixed point F is at — oo along the x-axis and that the fixed point F' 
is at the point (b, 0) of the x-axis, where b ^ 0. 

Intuitively, this means that a beam of light from — oo, parallel to the x-axis, is brought to 
a point focus at (6,0) by a single refracting curve, f(x,y) = 0, of index n. 

The line joining P to the point at infinity is the line parallel to the x-axis through P. 9\ 
is the angle the horizontal line through P(x, y) makes with the normal while 82 is the angle 
PF makes with the normal. Finally, / be the length of PF. 

Then, the earlier derivation of the ODE is applicable. We need only observe that 



So, substituting our two new formulas for sin B\ and sin 9 2 into Snell's Law gives us, instead 
of ( 12.1.3p . the new equation: 



1 



,11 



= n. 




After some rearrangement, we obtain the differential equation of the curve: 



1 - 



(b-x)- yy' 



•71 = 0. 



(2.2.1) 



^(b-xy + y 2 
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2.2.2 The Solution of the Differential Equation 



The ODE (12.2.11) can be solved by the same computations as we did for the ODE (12.1.41) 
which lead us to the 



Theorem 3. The general solution for the differential equation (12.2. ip of the perfect two- 
dimensional lens, C, with the radiant point at — oo is given by the equation: 



x 



+ ny/ (b — x) 2 + y 2 = c. 



where c is an arbitrary constant. 



(2.2.2) 



□ 



We observe that the equation (I2.2.2P has the following interesting interpretation. The 
equation (12.2.21) says that the ratio of the distance of the point P from the line x = to 



n 



its distance from the point (b, 0) is the constant ±n, and thererefore, by the focus-directrix 
definition, is a conic section. 

This theorem takes a more elegant form if we assume that the curve C passes through the 
origin. Then, the constant c = nb and, after rationalizing (12.2.21) . we obtain ([10], problem 
B-10, Chapter 20): 



Theorem 4. The general solution for the differential equation (12.2. ip of the perfect two- 
dimensional lens, C, is a conic section whose focus is the point where the light is focused 
and whose excentricity is the reciprocal of the index of refraction. 



1. Ifn 2 j^l,C given by the equation: 




(2.2.3) 



Therefore C is an ellipse if n 2 > 1 or an hiperbola if n 2 < 1, either one of which is 

, f nb \ 
centered at , . 

2. If n = 1, then C is the segment of the x-axis given by ^ x ^ b. 

3. If n = —\, then C is the parabola 



y 2 = Abx 



(2.2.4) 



□ 



The reader should compare this result with that of the form of the perfect reflecting 
mirror already cited in [5]. If n < 0, then we get reflection instead of refraction. 

Maesumi [11] used Fermat's Principle to treat this case in a very elegant paper, al- 
though his definition of the index of refraction is the reciprocal of our (standard) one. 
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2.3 Both Fixed Points are at Infinity 

Keeping the notation of the case of the radiant point at — oo, we assume that the refracted 
rays form a parallel beam in the direction such that 

9 2 + 4> = Constant, 

but, this means that 

sm(# 2 + 4>) = *~* = = C 
y/(b - x) 2 + y 2 

where C is some constant. But the condition that C goes through the origin means that 

C = l, 

and rationalizing the resulting equation we obtain: 

Theorem 5. If both fixed points are at infinity, then the perfect lens C has the equation: 



x = 0\ (2.3.1) 



That is, it is the vertical y-axis. 

□ 



3 Descartes' Theorem 



3.1 Drucker's Characterization of a Surface of Revolution 

In 1992 |5] Drucker published a very interesting paper in which he treated the problem 
of finding all perfect mirrors, i.e., mirrors which reflect all rays issuing from one radiant 
point to one image point. 

After showing that the two dimensional curve with the perfect reflecting property is a 
proper or degenerate conic section, he (implicitly) proved the following characterization of a 
surface of revolution. Drucker, himself, did not state it explicitly. 

Theorem 6. Let F and F' be two fixed points. If, for each point P of the smooth connected 
surface S the normal N at P lies in the subspace spanned by the vectors FP and F'P, then 
S is a surface of revolution whose axis of revolution is the line through F and F'. 

Proof. We offer a new proof of Drucker's theorem. It is based on an idea in Salmon 
[T5] which goes back to Monge [T2] . 

Since, by definition, the normal MN is in the subspace spanned by FP and F'P, it is in 
the plane of AFPF'. 

If MN is always parallel to FF' , then S is a plane which is perpendicular to FF'. We 
exclude this degenerate case for the rest of the argument. (See (12.3.11) ). 
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Therefore MN is not always parallel to FF' . Thus, the infinite line MN intersects FF' 
at some point. This is the characteristic property of the surface S. 

Let (a, (3, 7) be a point on FF' and let (I, m, n) be the line's direction numbers where we 
assume I ■ m ■ n ^ 0. The corollaries deal with the case where one or more coefficients are 
equal to zero. Then, the equation of the line FF' is 



x — a 
I 



V-P 



m 



z — 7 



n 



where t is the common value of the three fractions. 
Let 

F(x,y,z) = 



(3.1.1) 



(3.1.2) 



be the equation of S, where F is a continuously differentiable function of x, y, and z in some 
open set R, and let (xo>2/o> z o) be the point P on S.. 



Since MN is normal to S at P, its equation is: 



X — Xq 



2/ — 2/0 



z 



F x (x ,y ,z ) F y (x ,y ,z ) F z (x ,y ,z ) 



T 



dF 

dx 



(3.1.3) 



evaluated 



where T is the common value of the three fractions, and where F x (xq, yo, zq) 

in (xq, yo, zq), and where the other denominators have a similar interpretation. We assume 
that all three denominators are different from zero. The corollaries deal with the cases where 
the denominators are equal to zero. 

Solving equations ( 13. 1. ID and ( 13.1.31) for x, y, and z, and then equating the values ob- 
tained, we get the following homogeneous linear system for the unknowns t, T, and 1: 

It - F x (x , y , zo)T + (a - x ) ■ 1 = 
rat - F y (x , y , z )T + ((3 - y ) ■ 1 = 
nt - F z (x , yo, zo)T + (7 - z ) ■ 1 = 

The analytical condition that this system have a nontrivial solution, which it does by as- 
sumption, is that the determinant of their coefficients vanish: 



F x (x ,yo,zo) F y (x ,y ,z ) F z (xo,yo,z ) 
I m n 

xo- a yo- (3 z - 7 



(3.1.4) 



The determinant on the left-hand side of (I3.1.4p is (one half of ) the jACOBlan of the three 
functions 



n := F(x, y,z), u := Ix + my + nz, v := (x — a) 2 + (y — f3) 2 + (z — 7) 2 , 
evaluated at the point P of S. 



(3.1.5) 
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But the point P is totally arbitrary, which means the jACOBlan (13.1.41) vanishes in a full 
neighborhood of P, since F is a continuously differentiable function of x, y, and z in some 
open set R. According to a classical theorem (see Buck [2], Goursat [5J, Osgood [T3] . 
and Taylor [IT], if the JACOBlan of the three functions vanishes identically, then three 
functions are functionally dependent. 

That means that there is a function, Q(u,v), of the two variables u and v, defined and 
continuously differentiable in a neighborhood of the point (uq,Vq), where 

u := lx + my + nz , v := (x - a) 2 + (y - (3) 2 + (z - j) 2 

for which the equation 



F(x, y, z) = Q [ix + my + nz, (x — a) 2 + (y — (3) 2 + (z — 7) 2 } 



holds identically in a neighborhood of (xo,yo, zq). 
Now, the equation 



Ix + my + nz 



u 



(3.1.6) 



(3.1.7) 



represents a plane which cuts the line FF' (represented by (13.1.11) ) perpendicularly, while 
the equation 

' ^ ^ ^ ' (3.1.8) 



(x-ay + ( y -py + (z-jy 



represents a sphere of radius and with center (a,fl,j) on the line FF'. 

The points (x,y,z) which are on the plane, (13.1.71) . and on the sphere, (13.1.81) . simulta- 
neously, are on their circle of intersection and this circle has its center on the line FF'. 

Therefore, the equation (13.1.21) of S, i.e., Q(u,v) = 0, represents a surface generated 
by a circle of variable radius whose center moves along the line FF' and whose plane is 
perpendicular to that line. 

Thus, every planar transverse section of S, perpendicular to FF', consists of one or more 
circles whose centers are on the line FF'. 



That is, S is a surface of revolution with axis FF'. 
This completes the proof of Drucker's theorem. 



□ 



Corollary 1. If the z-axis is the axis of revolution, we may take the origin as the point 
(a, (3, 7), and the equation (I3.1.2p becomes 



F(x,y,z) = Q{z,x 2 +y 2 + z 2 } 



(3.1.9) 



□ 



There are similar simplifications in (I3.1.6P if we take the other coordinate the 
axis of revolution. 

Corollary 2. // F x (xq, y$, z ) = 0, then the normal is everywhere perpendicular to the x — 
axis and the equation ( 13.1. 21) becomes the cylinder of revolution: 



F{x, y,z) = Q {my + nz, (y - (3) 2 + (z - 7) 2 } 



(3.1.10) 
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□ 

There are similar simplifications if the other components of the normal are zero. 



Remark 1. In order to apply the theorem on functional dependence which we used in 
the above proof, we have to make sure that we comply with all the hypotheses. The only 
one, which we did not explicitly state in the body of the proof is, using the notations of 
(13.1.71) and (I3.1.8p . is that at least one of the three jacobians 

d(u,v) d(u,v) d(u,v) (3 111) 

d(x,y)' d(y,z)' d(x,z)' 

is different from zero at (xq, yo, Zq). 

We claim that even more is true in our case. We will prove that at least two of the 
jacobians (13.1. lip are different from zero. 

Suppose, to the contrary, that at least two of them are equal to zero, say 

5M -0 f4=0 (3.1.12) 



This leads to 



d(x,y) d(y,z) 



m m n 



x — a y — 0' y — (3 2 — 7' 

respectively. By (13. 1.13ft 



(3.1.13) 



m n 



x — a y — (3 2 — 7 



(3.1.14) 



which is the equation of the axis FF'. But, this means that S is just the straight line axis, 
which is excluded by the hypothesis that S is a smooth surface. Therefore, at least two of 
the jacobians (13.1.111) are different from zero and the theorem on functional dependence is 
applicable. 

Remark 2. The proof shows that the characteristic property of a surface S of revolution is 
that the normal to any point of S intersects the axis of revolution. 



3.2 Proof of Descartes' Theorem 

We adapt Drucker's definition 

Definition 3. Let S be a smooth connected surface and let F and F' be points not in S. We 
say that S is a perfect lens relative to F and F' if, for each point P in S: 

1. the normal N at P lies in the subspace spanned by the vectors FP and F'P, and 
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2. the sines of the angles which FP and F'P form with that normal are in constant 
ratio for every point P in S. 

By condition 2 of the definition, the cross-section of S sliced out by the xy-p\&ne is a 
plane curve C which is a perfect two-dimensional lens relative to F and F' . 

That means that C is either (part of) a cartesian oval, or (part of) a conic section, 
or a degenerate case of either one. 

Therefore, by condition 1 and Drucker's Theorem, a three dimensional perfect lens S 
is (part of) a surface of revolution with axis FF' obtained by rotating a two-dimensional 
perfect lens S around it. 

This completes the proof of Descartes' Theorem. 

□ 
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